Multipass algorithm for acquisition of salient acoustic morphemes
نویسندگان
چکیده
We are interested in spoken language understanding within the domain of automated telecommunication services. Our current methodology involves training statistical language models from large annotated corpora for recognition and understanding. Since the transcribing of large speech corpora is a resource consuming task, we are motivated to exploit speech without transcriptions. In particular, we learn the semantic associations for a task exploiting only phone-based sequences from the output of a task-independent ASR-system. In this paper we present a new multipass algorithm for acquiring salient phone sequences from untranscribed speech corpora and evaluate their utility for the HMIHY task. Compared to our previous strategy, this algorithm is shown to produce improved call-classification results while reducing up to 7-fold the number of salient phone-sequences selected for training.
منابع مشابه
Accuracy Order of Grammatical Morphemes in Persian EFL Learners: Evidence for and against UG
This study addresses the acquisition of the morphological markers in Persian learners of English as a foreign language. To this end, the accuracy order of nine morphemes including plural –s, progressive –ing, copula be, auxiliary be, irregular past tense, regular past tense –ed, third person –s, possessive -ʼs and indefinite articles was studied in 6...
متن کاملErrors of Omission and Commission in Verbal and Nominal Inflectional Morphemes by Children with SLI: Phonological Effects and Acoustic Analysis
It has previously been shown that inconsistency in the early morpheme productions of typically developing (TD) children and those with Specific Language Impairment (SLI) can be partly explained by the phonological complexity of the coda. However, it is not yet known whether TD and SLI children have similar underlying processes of morpheme acquisition. Of particular interest is the reported late...
متن کاملRunning head: L1 INFLUENCE ON MORPHEME ACQUISITION ORDER 1 L1 Influence on the Acquisition Order of English Grammatical Morphemes: A Learner Corpus Study
We revisit morpheme studies to evaluate the long-standing claim for a universal order of acquisition. We investigate the L2 acquisition order of six English grammatical morphemes by learners from seven L1 groups across five proficiency levels. Data are drawn from approximately 10,000 written exam scripts from the Cambridge Learner Corpus. The study establishes clear L1 influence on the absolute...
متن کاملAn empirical study of multipass decoding for vietnamese LVCSR
In this paper, we represent an empirical study of multipass decoding for Vietnamese LVCSR. We report our experiments with N-best, lattice and consensus decoding on the VNBN data. Results from this study indicate that our acoustic model for Vietnamese was precise. The results could be investigated in further steps to improve the performance of our system. Index Terms Vietnamese, Acoustic Model, ...
متن کاملمقایسه روشهای مختلف یادگیری ماشین در خلاصهسازی استخراجی گفتار به گفتار فارسی بدون استفاده از رونوشت
In this paper, extractive speech summarization using different machine learning algorithms was investigated. The task of Speech summarization deals with extracting important and salient segments from speech in order to access, search, extract and browse speech files easier and in a less costly manner. In this paper, a new method for speech summarization without using automatic speech recognitio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001